DEC-31-01 16:46 F rom : ENG I NEER I NG 



5086503327 



T-808 P. 01 Job-448 



Docket No. 20243/25 

METHOD FOR FINDING CONTOURS IN AN IMAGE OF AN OBJECT 

5 FIELD OF THE INVENTION 

The invention relates to machine vision, and more particularly to machine vision 
inspection methods. 

BACKGROUND OF THE INVENTION 

Various manufacturing processes and other industrial processes require 
10 inspection of manufactured and processed parts to identify and/or classify defects. 
H Automated inspection methods using machine vision techniques are commonly 

g used to perform such inspections. Due to the vast number of possible surface qualities 
W and textures, machine vision applications that are useful for inspecting certain surface 
Jj types may not be well suited for inspecting certain different surface types. Variations in 
fl image feature contrast and reflectivity, for example, can hide critical defects in a 
g machine vision image. 

flj Optical fiber manufacturing operations, for example, use various machine vision 

inspection techniques for quality assurance or in-process inspections of fiber ends. In 

p fiber-optic cable production, once a fiber has been cleaved, It is polished to ensure its 

3© end face is smooth and clean. During the polishing process, the fiber is typically placed 
against an abrasive disc that polishes the fiber end surface. This process can cause 
defects, which if undetected can block the proper transmission of light through a fiber. 
Most fiber end defects can be categorized into one of four categories: spots, scratches, 
cladding boundary defects, and cracks. Lightwave signals that are transmitted through 

25 optical fibers are degraded when photons that make up the light wave encounter such 
defects and are deflected. While certain defects are considered acceptable and will not 
interfere with transmission of lightwave signals, others - especially those near the core 
of the fiber - may be critical. Fiber end defects must be identified and classified during 
the manufacturing process so that the fibers can be cleaned, repolished or discarded 

30 entirely, depending on the size and location of the defects. 
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Optical fiber manufacturers have traditionally relied on manual inspection to 
detect end surface defects that remain after polishing. One typically performs such 
manual inspections by inserting a cable into a fiber microscope, which is also called a 
fiberscope, and analyzing a magnified view of the fiber end connector (which includes 
5 the fiber end and ferrule). Although manual inspection methods are expensive, the 
alternative, i.e., machine vision inspection of optical fiber end surfaces, has proven 
difficult to implement. Optical fiber end surface inspection has presented many 
obstacles to the application of automated inspection techniques using machine vision 
systems. While standard machine vision methods are adequate for many industrial 
10 machine vision applications, highly specialized operations, like inspecting polished fiber 
_ ends, often require specialized vision methods. A machine vision method implemented 
in software and/or hardware is typically called a vision "tool." 

Scratch defects on fiber end surfaces appear as straight or curved lines in an 
acquired image. Machine vision systems used for inspecting fiber ends detect these 
lines to locate defects. The lines often have very low contrast and are speckled with 
noise and texture artifacts and therefore do not always appear continuous to common 
edge detection techniques that use local metrics or small kernel sizes. Such contours 
that appear discontinuous in an image are particularly difficult for machine vision 
m systems to identify. These straight or curved contours are the image features that are to 
be identified and located when inspecting fiber ends for scratches. 

One common technique well known in the art of computer vision is called 
binarization, where pixels in an image are classified into two groups. One group typically 
consists of pixels with a grey level value above a threshold and the other consists of 
pixels with a grey level below the same threshold. Once pixels are labeled as belonging 
to one of two groups, a technique called connectivity analysis can be performed. 
Connectivity analysis considers pixels that touch each other either at the sides, top or 
bottom or diagonally at the comers to be connected. Groups of connected pixels are 
commonly referred to as blobs. This technique is commonly referred to as blob analysis. 
Once blobs are extracted from an image, they can be analyzed by calculating various 
metrics, including but not limited to area, first and second order moments, location, 
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orientation and perimeter. The metrics could then be used to identify scratch-like 
features in an image. 

The advantages of binary techniques such as blob are their simplicity and high 
speed. However, the major shortcoming of blob and other binary techniques is that a 
5 grey level threshold must be calculated, in actual applications, a separate processing 
step must be run to determine a binarization threshold. This is not always easy to do 
given that there may be many confusing features in the image, in many cases, 
especially in fiber end inspection, a single threshold may not exist due to low frequency 
intensity gradients across the image caused by the backlighting effects and the 
10 existence of the fiber core. Another shortcoming of binary techniques is that they only 

provide whole pixel accuracy. Also, scratches can have low contrast thus making it vary 
f'J difficult to find a usable threshold. When the texture and noise in the image has intensity 
O values that vary from the mean of the background by as much as the contrast of the 
jji scratch itself, a binarization of the image will include many extraneous pixels in the 
% same group as the scratches. This extraneous data must then be factored out when 
m recognizing and localizing scratches. 

f Another common method of image analysis used in industry is referred to as 

111 template matching, in which a template is compared with like-sized subsets of the 

hi 

f ■ image over a rang© of positions. The position having the greatest match with the 

It) template is determined to be the position of the object in th© image, a numeric 
representation of the degree of match can be generated and used for inspection 
purposes. One disadvantage of template matching methods is that they require a 
training step wherein a model image or template of objects to be located is generated 
and stored. For the inspection of fiber end surfaces, it is very difficult to generate 

25 models of defects such as scratches due to their variability. One would have to create 
models of potential scratches at many possible angles, lengths and curvatures. All 
these models would then have to be searched for in an image. This type of exhaustive 
technique is impractical since it would not run in a reasonable amount of time. There are 
many different types of template matching techniques used in computer vision euoh as 

30 matching with binary images, matching with grey scale images and normalized 
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correlation which can handle differences in the mean pixel grey level values between 
model and image. All of these variations on template matching suffer from the same 
shortcoming as mentioned above. 

A method of geometric pattern matching was developed to overcome the 
5 limitations of image-intensity-based template matching methods by representing objects 
as geometric shapes independent of shading and independent of a discrete grid. The 
geometric pattern matching method can tolerate and measure wide variations in 
orientation and size of objects in an image, can be performed with high speed and high 
accuracy, and is able to tolerate severe shading variations. An exemplary geometric 
1Q pattern-matching tool called PatMax® is available from Cognex Corporation of Natick, 
Massachusetts. 

q The geometric pattern matching technology used in PatMax includes methods for 

y feature detection, training, affine searching, pose refinement, and inspection. Feature 
ill detection produces a geometric description of object boundaries as an ordered list of 
|f boundary points that lie along contours separating dissimilar regions in the image. 
W Unlike traditional edge detection, which simply marks certain pixels as edges, each 
J s , boundary point specifies positions and orientation as real-valued coordinates, 
f(* The geometric descriptions of objects produced by feature detection can be 

U translated, rotated, scaled, stretched and otherwise transformed with no loss in fidelity. 
|| PatMax uses a six-degree-of-freedomi affine transform which includes two degrees of 
freedom of translation and one each of rotation, scale, aspect ratio and shear. Each 
non-translation degree of freedom is specified as either a fixed value or a range of 
values to be searched. A fixed value for a degree of freedom allows a transformed 
version of the trained pattern to be found with no time penalty. For example, a single 
25 square pattern can be trained and used to find rectangular objects at a variety of known 
sizes and shapes by setting appropriate fixed values for the scale degrees of freedom. 

In order to recognize an object in an image, analysis tools, such as PatMax, must 
first be trained by acquiring and storing an image of a representative object or model. 
The model can be developed as an image of an actual object or can be synthetically 
30 constructed using CAD tools. During runtime operation, the maohine vision system 
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acquires an image that may include the object of interest at any scale or orientation 
relative to the viewing device. The analysis tools will recognize the object in the image 
and compute the degree of rotation, scale and translation of the object in the image 
relative to the trained model. This computation is sometimes referred to as an 
5 alignment of the object. Alignment may also allow other degrees of freedom. 

While PatMax has some flexibility in performing a rotation and scale invariant 
search, it is not optimal for finding scratches on fiber ends. One reason is that scratches 
that appear as lines are very self-similar along the line itself. Geometric pattern 
matching techniques will have difficulty localizing line models along lines in an image. 
10 Scratches may be of any length and finding lines of varying lengths with a single line 

model of a fixed length that can be scaled has problems. The tool will find a good match 
g in the dimension perpendicular to the line, but along the line there can be many 
□ potential matches, all with an equal score. The many good matches can make the tool 
m take a long time to execute and when the line model is allowed to change scale, the 
t| match calculation can become numerically unstable. PatMax is also susceptible to the 
y problems with lower-level edge detectors as described next, PatMax also requires 

moderately sharp edge features with reasonable contrast, 
ru Also, both template matching and geometric pattern matching techniques 

y k assume that the object to be found will have a consistent appearance, and this is 
M usually not the case for scratches in a fiber end face, For example, the acralohes can 

S!SS 

have a range of curvatures, and some may even have sharp turns. Thus, there is 
typically no affine transformation that could match a given curved scratch pattern to an 
actual curved scratch, since the shape may be different. 

Scratches can be particularly difficult to detect because they usually have low 
25 contrast with respect to a fiber end surface. Various lighting methods can increase the 
contrast of scratches and cracks during image acquisition so that they can be more 
easily detected. For example, viewing a surface directly while applying side lighting, 
viewing a surface at an angle with lighting from the opposite angle, or transmitting light 
through the fiber core can make soratches and cracks more visible. These types of 
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lighting and viewing angle changes can enhance the quality of the scratch contrast but 
still will not be enough to make simple vision techniques work. 

Another method of detecting scratches is to project the image of the fiber end 
surface to form a one-dimensional (1 D) projection image. This type of projection 

5 technique is well known in the art and involves summing the pixels along rows of an 
image such that the resulting image is a one-dimensional image that has as many 
elements as rows in the source image and each element contains the sum of all the 
pixels in a row. Lines that are lighter than the bacKground and that are horizontal will 
appear as peaks in the 1D projection image. The location of the line is then determined 

10 from the row in which the peak occurred. This is gsneralizable to ail angles where the 
image is rotated into position such that the candidate angle aligns to the horizontal 

[IJ rows. The shortcomings of this type of projection technique are that the length of lines 

0 cannot be determined and that only straight lines can be found. Projections are also 

W 

III very susceptible to confusion by bright spots. A bright spot appears on a projection 

i| exactly like a line with lower contrast. 

|i) Many machine vision methods require extracting what are called primitive or low- 

J level features and then building more complex objects or features from those simple 

III ones. The most basic low-level feature is a single pixel that has a grey value. A more 

1 = i 

T: complex feature would b© an edge feature that results from applying a Sobel or Roberts 

20 operator that is well known in the art. These are small kernel operators (3X3 and 2 X 
2) that return the direction and magnitude of the intensity gradient in the image at the 
location of the center of where the operator is applied. These operators are known in 
the art as edge detectors. While the previous techniques discussed (blob and template 
matching) analyzed the pixels of an image directly, the following techniques that have 

2S been used for scratch detection rely on processing the results of edge detectors that are 
called edge features, 

Commonly used edge detectors have several shortcomings when used to detect 
scratches on fiber end surfaces. Scratches can be irregular at a small scale (1 or 2 
pixels) but still appear uniform and straight at a larger scale. Scratches can also be faint 

30 and have low contrast. Since the support of the edge operator is no larger than 3X3, 
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only sharp higher contrast edge features will be detected. Also, the small-scale 
irregularities of the scratches will generate many false edges and miss many valid 
edges such that the subsequent processing steps, such as chaining, have too many 
features to consider, thus making such processing run slowly and produce erroneous 

5 results. 

Often the problem of detecting and tracing scratches becomes a problem of 
grouping features such as edge features to form higher-level contours. One well known 
method of grouping low-level features into contours or lines that can be called scratches 
is to perform a Hough transform on the image space. 
10 The Hough transform aids in the detection of lines or arcs of a particular shape. 

u Each arc or line has some free parameters that, when specified precisely, define the arc 
0 or line. The free parameters constitute the transform domain or the parameter space of 
[J the Hough transform. Pixels or other features tn an image are mapped into the Hough 
U parameter space, which is quantized into discrete bins called accumulator cells (a 

6 multidimensional array). Peaks in the accumulator array indicate that a line or an arc 
w may exist at corresponding points in an image. See Robert M. Haralick and Linda G. 
H Shapiro, Computer and Robot Vision. Volume II . p. 1 84 (1 993) . 

}}{ When applied to identify straight lines, an implementation of the Hough 

transform, called "line Hough," is used; typically, edge features of an image are used as 

16 the immediate inputs to line Hough. The line Hough is limited to identifying straight lines 
and, by design, detects lines that are very sparse. For example, two short segments on 
opposite sides of an image that happen to be co-linear will be reported as a line even if 
they do not correspond to the same continuous line segment. The line Hough has 
additional limitations including the generation of large numbers of candidates that must 

25 each be separately verified. 

A more general Hough transform is used to find curves of specific shapes and 
sizes. Arbitrary curves or contours, such as some fiber end scratches, however, cannot 
b© found by any implementation of the Hough transform. Furthermore, the quantization 
of Hough space into accumulator cells can introduce significant error in the reported 

30 location of candidate lines or curves. Hough transforms also typically rely on lower level 
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edge detection such as Sobel or Robert transforms and therefore suffer all of the 
limitations of those tower level operations. 

Unlike Hough transform methods, chaining methods operate locally and do not 
connect unrelated pixels that may happen to fall on a predefined contour. However, the 
5 local scope of chaining methods can prevent them from bridging small gaps to connect 
edge segments that do belong to the same contour. Such small gaps occur quite 
frequently in images and are especially problematic in low contrast defects such as 
scratches in optical fiber end surfaces. 

U.S. Patent No. 5,995,212 to Dar et al, (hereinafter referred to as "the Dar 
10 patent"), which is hereby incorporated by reference in its entirety, describes a method of 
u inspecting an optical fiber for scratches using a windowing technique which implements 
O the Hough transform. The method according to the Dar patent disadvantageous^ 

IT"! 

[J requires extensive pre-processing to isolate regions of suspected scratches in which to 

Ifl apply the Hough transform. More significantly, the method according to the Dar patent 

1)3 suffers the same disadvantages of the general Hough transform in that it connects 

5 ,. '£ 

f widely separated pixels that may not be part of a common contour. 

H Another approach to identifying contours in an Image Is chaining of edge 

JJj features. U.S. Patent No. 5,179,419 to Palmquist et al. (hereinafter referred to as "the 

M Palmquist patent"), which is hereby incorporated by reference in its entirety, describes a 

p£> method for detecting, classifying and quantifying defects including scratches. The 

method taught by the Palmquist patent breaks edge features in an image into segments 
to distinguish overlapping defects. Connected segments are decomposed and labeled 
to group pixels based on connectedness. Segments are connected to follow curves and 
break at sharp corners. Small segments caused by noise are discarded. The method 
25 of chaining edge segments taught by the Palmquist patent uses well-known filters such 
as Sobel filters. The chaining steps performed by Palmquist are much like base 
chaining functions performed by several image analysis tools, including PatMax. 

The method disclosed by the Palmquist patent disadvantageous^ requires the 
acquisition of an optimal focus image and auxiliary images behind and in front of the 
30 optimal focal positions. Since the method in the Palmquist patent uses simple Sobel 
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edge detection, it suffers from all the shortcomings associated with this type of feature 
extraction technique as described previously. 

U.S. Patent No. 6,167,150 to Michael et al, (hereinafter referred to as "the 
Michael patent"), which is hereby incorporated by reference in its entirety, discloses a 

5 method of detecting edges in an image that represent scratches in disk drive heads, 
The method disclosed in the Michael patent detects edges in the image and determines 
if edges are connected using the magnitude and orientation of the edges. After 
eliminating edges based on magnitude, the orientation of neighboring edges are 
compared and are determined to form connected edges when the angle of the 

10 neighboring edges are within a pre-determtned range of each other. 

The method disclosed in the Michael patent works well to detect very thin 

jssif 

O scratches in the order of one pixel wide because it is designed to locate edges. Wider 

scratches, in the order of five to ten pixels wide, however, are recognized as two 
if! separate entities because each edge of such wider scratches is separately detected. 
J| This redundant reporting of found features requires additional processing, for example, 
to associate two edges with a common scratch feature. In addition, fiber scratches that 
are irregular, low in contrast, or formed by a series of dots that are not necessarily 
connected, do not form continuous connected edges and will not be detected by this 
method. In fact, edge detectors may not deteot a string of dots at all. 



■ 



La 



SUMMARY OF THE INVENTION 

The invention provides a general method for finding contours in an image of an 
object. In a particular embodiment, the invention finds contours that are straight lines or 
are curves with a large radius of curvature, tn an illustrative embodiment, the invention 
25 detects scratch defects in a polished end-face of a fiber-optic cable. 

In accordance with the principles of the present invention, features are extracted 
from an image, those features are chained using local connectivity analysis with 
chaining constraints to form contour segments, contour segments are successively 
selected as candidates for joining together to form contours using hash functions to 
30 greatly reduce the number of candidates that need to be considered, and contour 
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constraints are used to choose among the candidates. An example of a contour 
constraint useful for finding straight lines with relatively small internal gaps would be that 
the endpoints of the two contour segments would need to be close and the overall RMS 
error of a line fit through the points of both segments would need to be small. 
5 In an illustrative embodiment, the features are edgelets produced by generating a 

magnitude image and angle image, using truepeak (non-maximal suppression along the 
gradient) to determine approximate edge location, and interpolation of magnitude values 
to produce a location with subpixel accuracy. Edgelets are small segments of edges 
with magnitude, orientation, and approximate 2D location. In another illustrative 

10 embodiment, a Sobel edge detector is used to generate the magnitude and angle 

l. images. In another illustrative embodiment, a set of oriented filters is used to generate 
5 the magnitude and angle images. 

hi In an illustrative embodiment, the present invention greedily combines contour 

J[{ segments into larger contours (which may have gaps). To avoid increasing complexity, 

11 two contour segments are considered for combining based primarily on the minimum 
distance between the endpoints of the segments, A decision on whether to combine the 

h* segments Is then based on a metric that represents the probability that both segments 
jjj come from the same contour. To find straight lines, for example, the RMS fit error of a 
tl line fit to aH P oints °f both segments can be used as a metric. To find slightly curved 
|1 contours, the radius of curvature within some fit error can be computed and used as a 
metric. 

This process can be iterated, optionally varying the constraints on the metrics, to 
allow efficiently building up of higher order features. For example, once an initial pass is 
made which considers only very close contour segments, a second pass can then be 
25 made where a greater separation is allowed for groups of segments of a certain length, 
perhaps also placing tighter constraints on the angles. 

This invention overcomes limitations of the prior art by providing methods for 
effectively analyzing surface inspection images to identify low contrast contours and 
discontinuous contours of arbitrary shape. 
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The present invention makes contour identification in machine vision images 
more efficient and robust. Curves that cannot be found by other techniques can be 
found by methods according to the present Invention. Disadvantages of the Hough 
transform including false correlation of sparse contour segments are overcome 

5 according to the present invention. Quantization error effects that plague known Hough 
transform methods are also avoided by the method of the present invention. The 
present invention further overcomes ithe disadvantages of known chaining methods by 
bridging local gape in low contrast defects. The present invention further overcomes the 
disadvantages of known global contour finding methods by using hash functions to 

10 reduce the computational complexity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

UJ The above and further advantages of the invention may be better understood 

in 

*g from the following detailed description, in conjunction with the following figures, referred 

# to herein and constituting a part hereof, wherein: 

UJ 

Figure 1 is a schematic block diagram of a conventional machine vision system; 
|f. Figure 2 is a top-level flowchart of the invention; 

UJ Figures 3A and 3B, when placed together, form a flow chart illustrating the steps 

is* 

U performed to implement at least one embodiment of the present invention; and 
2Q Figure 4 is a graphical representation of an Image space divided into spatial hash 

bins according to at least one embodiment of the present invention. 

DETAILED DESCRIPTION 

The method according to the present invention locates contours In an image and 

25 is particularly useful in detecting defects suoh as scratches in optical fiber end surfaces. 

The method provides enhanced detection of such scratches, which typically appear as 

faint sets of locally disconnected features having low contrast in machine vision images. 
Figure 1 illustrates in schematic form an example of a machine vision system that 

can utilize the process of the present invention. The example system demonstrates the 
30 inspection of the end-face of an optical fiber, but this example is merely illustrative and 
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not restrictive, (n the example system, a fiber microscope 100 examines an optical fiber 
110 held in a mounting device 1 05. The fiber may also be presented to the system by 
other well-known mechanisms, When a fiber is in the mount, an image stream of the 
fiber is created by a camera system 1 1 5 in a conventional manner. Light is provided by 
5 a light source 1 02, and a half-silvered mirror 104 is used to direct the light onto the fiber 
end-face while also allowing the fiber end-face to be imaged by the camera 115. Other 
arrangements of optics and lighting are possible. The image stream created by the 
camera 1 15 is provided to a computer system 120 that processes the Image according 
to the method of the present invention. 

0 The computer system 120 in general comprises an image processing system 125 
and an object processing system 130. The image processing system, in turn, includes 

J a frame grabber 1 35 that creates a series of fixed images from the image stream 
generated by camera 1 1 5. The series of fixed images are provided to a vision tool 140 

.1 

1 that detects and measures scratch defects in the images in accordance with the 

| principles of the present invention. The vision tool may also detect other types of 

y 

y defects or make additional measurements. The results of the vision tool are then 

passed -to the object processing system 130, which controls a selection or manipulation 
system 145. The selection or manipulation system treats each fiber 110 based on the 
results of the object processing 130. For example, if the vision tool finds no critical 
8 defects in the fiber, then the object processing system may direct the manipulation 

.as 

system to transfer the fiber to a bin of acceptable fibers, but if the vision tool finds a 
critical defect, then the object processing system might direct the manipulation system 
to send the fiber off to be polished again, followed by reinspection. The manipulation 
system might consist of just a human operator or might be automated. Many other 

5 actions are also possible, especially when the present invention is used for applications 
other than fiber end-face Inspection. 

Steps in the method for finding contours in an image of an object are shown in 
Figure 2. The process begins in step 200 and proceeds to step 205, where an object is 
presented to the camera 1 1 5, such as by placing a fiber in the mounting device. In step 

0 21 0, an image of the object is acquired as discussed above. 
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in step 215, features are extracted from the image- Many different techniques 
are possible for extracting features that are relevant to contour finding, but the features 
are usually edges or portions thereof or line or cutve segments. However, other types 
of features are also possible and are within the scope of the present invention, For 
5 example, in some applications the contours of interest might be indicated by bright 
pixels, and such bright pixels could be used as the features. 

Processing then proceeds with step 220, where chains are formed from localized 
subsets of features, applying one or more chaining constraints. Typically, the chains 
are formed from edgelets that are adjacent on the pixel grid and that have compatible 
10 angles, and magnitude thresholds may also be used to constrain which edgelets form 
chains, 

3 In many applications, a single contour will consist of two or more chains that 

have gaps between their ends. In order to efficiently find chains that belong to the same 
f! contour, without having to consider all possible combinations of chains, one or more 
lp hash functions are used to compute hash values for some or all of the chains in step 
■M 225. Only chains with identical or simitar hash values are considered as potential 

candidates for belonging to the same contour. For example, one hash value (per chain 
endpoint) might represent the coarsely quantized location of one chain endpoint, and 
| s i similar hash values would be the hash values representing nearby coarsely quantized 
5j6 locations; another hash value might represent the angle or curvature of a chain, and 
similar hash values would be the hash values representing similar angles or curvatures. 
In step 230, a seed chain is chosen and candidate chains for joining with it to form a 
contour are determined based on their hash values. In step 235, chains are chosen for 
combination from amongst the candidates using a set of contour constraints. 
25 in step 240, the chosen chains are joined to form a contour segment. Next, in 

step 245 a determination is made whether to process more chains. For example, 
processing might proceed as long as any chain had not yet been joined to a contour or 
considered as a seed chain, or it might proceed only until at least one contour meeting 
certain criteria had been found. If a decision is made to process more chains, then the 
30 hash values for the newly formed contour segment may be updated in step 250. For 
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example, if the hash values represent the coarse locations of the two endpoints, then 
the contour segment will have hash values representing the two new endpoints, and not 
the endpoints that were connected to form the segment. In step 255, a determination is 
made whether additional hash values need to be computed. Typically, no additional 

5 hash values are needed if all chains had hash values computed originally in step 225, If 
more hash values are needed, processing returns to step 225 to compute those hash 
values; otherwise, processing returns to step 230 to choose a seed chain and identify 
candidate chains for joining to it. Note that the seed "chain" and candidate "chain" may 
also be contour segments at this point in the processing. In other words, contour 

10 segments can be used in exactly the same way as chains at all points in toes processing, 
thus allowing iterative or recursive grouping of contour segments. In a typical 

pj implementation, this property is most easily achieved by representing the initial chains 

O as contour segments, 

lii 

ill If, in step 245, the decision is made not to process more chains, then the current 

ih image of the object has been completely analyzed, and a determination is made in step 

ill 

hi 260 whether to process images of additional objects (or perhaps even additional images 

of the current object, presumably after some modification, such as repolishtng a fiber 
fll end). Processing thus may continue by returning to step 205 to present an object to the 

j*f imaging system or may terminate at step 265, 

f'"~ 

§1 Many feature extraction techniques may be used for step 215. In an illustrative 

embodiment, a Sobel edge detector generates gradient magnitude and gradient angle 
values for the image or region thereof, then non-maximum suppression using the 
gradient direction (also known as "truepeak") is performed to produce truepeak 
edgelets, and finally subpixel interpolation is performed to produce subpixel edge 

25 locations for the edgelets. Truepeak may also use a magnitude threshold to eliminate 
weak edges. These techniques are known to those skilled in the art. Note that 
truepeak responds only to edges and not to uniform gradients such as those caused by 
nonuniform diffuse lighting. Such edgelet generation Is known to those Skilled in the art, 
and one implementation is an edge detector available from Cognex Corp. Sobel Is 

30 particularly useful in applications where the edges are fairly sharp. In another 
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illustrative embodiment, a set of oriented filters, such as Gabor filters, is used to 
produce magnitude and angle information which is then processed as above to form 
oriented subpixel features. Oriented filters may be used for this purpose according to 
the method described in U.S. Patent Application No. by Yusuf 

5 Akgul et al. entitled "Method for Extracting Features from an image Using Oriented 
Filters" and filed on December 28, 2001, which is hereby incorporated herein by 
reference in its entirety. Note that the features may actually be line segments or curve 
segments and not only edge segments; in particular, a line with substantial width may 
be represented by single "linelets" instead of pairs of edgelets, thus making detection of 

10 the line as a single contour (instead of as a pair of contours) much simpler. For 

simplicity, the term "edgelef will be used to refer to all such features; such usage is not 

H intended to be restrictive. Many other feature detectors are possible, and their use is 

n 

g within the scope of the present invention. 

jjj Large oriented filters are particularly useful for finding contours representing 

0 scratches in an image of a fiber end face. Such scratches are often quite faint and have 

irregular intensities and somewhat diffuse boundaries, and they are thus difficult to 
= detect witn local edge detectors such as Sobel. Various image preprocessing steps 
pj may also be useful when performing feature detection, as is known to those skilled in 
W the art. For example, the image might be smoothed or sharpened using filters. 
|f Additionally or alternatively, the image might be subsampled to increase processing 
f " speed or to change the spatial scale to which the feature detector would respond. 

The feature detection step 215 typically produces very low-level features such as 
edgelets, thus giving an indication as to whether a contour may pass through that pixel 
with its approximate tangent angle at that pixel, However, many of the features thus 
25 produced will actually be the result of noise and not of real contours of interest. There 
are typically very many such low-level features, so that trying to process them all with a 
global contour-finding procedure would be very computationally expensive (because of 
the large number of features) and not very robust (because of the relatively large 
uncertainty associated with such low-level features). Thus the chaining step 220 
30 performs a local connectivity analysis on the low-level features to produce locally 
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connected chains, which are segments of contours. Such a step is usually not sufficient 
in and of itself to find all the contours of Interest, both because many contours appear to 
have gaps in typical images and because local connectivity rules cannot make optimal 
decisions at branch points where chains intersect). Chaining thus has the goal of 
5 producing fewer but more reliable features, called chains, for use by subsequent 
processing. The chains are usually line segments or curve segments. 

Many techniques for chaining are known to those skilled in the art, and all are 
within the scope of the present invention. In order to be efficient, chaining typically 
makes use of the arrangement of the low-level features on the pixel grid and analyzes 
10 small local regions to connect the features into chains. Chaining may use constraints in 
order to connect only those features that seem to form part of a consistent contour. For 
H* example, two horizontally adjacent edgelets which are each horizontal might be 
p. connected to form part of a chain, but two horizontally adjacent edges where one is 
W horizontal and one vertical may not be chained, especially if one wants to break chains 
|| at sharp comers. In general, if the angles of two adjacent edgelets are similar to each 
p g other and somewhat similar to the direction of the displacement between the edgelet 
positions, then the edgelets are considered compatible and it is reasonable to chain the 

Jf; two edgelets together The precise chaining rules may vary. In an illustrative 

Fu 

ui embodiment of the invention, the particular method generates only unbranched chains; 

|8 each element of the chain has at most two neighbors and the chain does nott intersect 

M itself. In this embodiment, branches of the chain are broken into separate chains at any 
intersections. Alternative embodiments of the invention can easily be envisioned 
wherein a large web of unbroken branches remain connected. 

Depending on the nature of the feature detection, the edgelets may have angles 

25 in the range [0, 360) degrees or only in the range [0, 180) degrees. For example, Sobel 
edgelets typically have a 360 degree range because, e.g. 0 degrees indicates a vertical 
line which is dark on the left and bright on the right while 180 degrees indicates a 
vertical line which is bright on the left and dark on the right. For edgelets with a 360- 
degree range, one should ordinarily not chafn together edgelets whose angles differ by 

30 approximately 1 80 degrees. On the other hand, oriented filters which have been tuned 
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to detect lines instead of just edges may have a meaningful range of only [0, 180) 
degrees, since 0 and 180 degrees may each indicate a vertical line without necessarily 
indicating its polarity. For edgelets with a 180-degree range, one should ordinarily 
consider edgeiet angles near 0 and near 180 as being compatible with each other. 
5 In an illustrative embodiment, as part of step 220 a sub-chaining step is 

performed that breaks the chains into linear pieces, i.e., straight line segments. The 
straight line segments are developed in the exemplary embodiment because the defects 
of interest, i.e. scratches, are typically straight lines or curves having large radii of 
curvature that can be constructed from connected straight line segments. The 
10 illustrative embodiment does not, for example, look for contours that have numerous 

sharp turns or which zigzag across an image. Such chain breaking can be easily 
□ accomplished by starting from one end of the chain and iteratively computing the best fit 
y line to the first N points and breaking it at a value of N where the RMS error of the line fit 

exceeds some threshold. One may also want to break the chain at any edgeiet where 
JjjS the edgeiet angle differs from the angle of the fit line by too great an amount. Breaking 
UJ then proceeds to process the rest of the chain, producing additional sub-chains if 
y, necessary. The illustrative embodiment uses an RMS threshold of 0.5 pixels and an 
jj? edgelet-to-line angle threshold of 30 degrees. Alternatively, one might use information 
I from the entire initial chain to determine optimal breaking points, as is known in the art. 
H> The sub-chaining step is merely illustrative and not restrictive, and the sub-chains will 
simply be referred to as chains except where the distinction is necessary for clarity. It is 
also possible to enforce chain linearity during the initial chaining step instead of by 
postprocessing the chains. Some embodiments may not want to enforce any sort of 
linearity constraint on the chains, since for high-contrast contours the chaining may tend 
25 to produce correct contour segments even when they are curved. 

The chains comprise a set of fewer but higher-quality features than did the 
original features. Chains may be thresholded by length so that very short chains, e.g. 
those with fewer than four edgelets, are discarded. They may also be thresholded by 
magnitude (e.g. average or median magnitude of the edgelets) to discard weak chains. 
30 In the illustrative embodiment, the average chain magnitude is used as the basts for 
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magnitude thresholding. Discarding low-magnitude chains is an effective way to 
discriminate against video noise and iow-leve! texture in images. 

In some applications, the chains produced by step 220 may by themselves form 
a sufficient representation of the contours in the image. For example, the contours 
5 might be fairly high in contrast and consistent in appearance so that the edgelets 
representing the contour are continuously connected, or the particular application might 
not require that each maximal contour be detected as a single entity. However, it is 
usually desirable to determine the set of maximal contours, and contours often do not 
appear continuous in images. In particular, scratches in fiber end-face inspection 
10 images tend to appear as low-contrast discontinuous contour segments. 

It is thus necessary to further analyze the chains, which represent contour 
segments, in order to combine and connect them to form the complete contours. In 
particular, it is necessary to determine which sets of contour segments each belong to 
the same contour, and in general this is a very difficult problem. However, it is often 
% possible to specify a set of constraints on the possible appearances of contours so that 
it is possible to determine if two contour segments should be considered part of the 
same contour. The constraints typically also enable choosing the most likely 
interpretation among a set of possible interpretations of a set of contour segments as 
contours. For example, if the desired set of contours consists of straight lines, then two 

% contour segments probably belong to the same contour if they are sufficiently co-linear. 

r** 

Note that at this point, the contour segments could be input to an method such as 
the Hough transform to try to determine likely contours. However, such techniques 
disadvantageously require parameterization of contours, do not allow for more refined 
decisions based on constraints, and often produce a large number of false results in 
25 addition to the desired ones. 

The most straightforward more general approach to determining a set of likely 
contours is to consider every contour segment for compatibility with every other contour 
segment. Such an approach requires 0(N 2 ) comparisons for N contour segments even 
if a greedy approach is used, in which a decision can be made for each segment 
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independently; without a greedy approach, the combinatorics of determining all possible 
sets of compatible contour segments is computationally infeasibla 

It is thus highly desirable to be able to identify which contour segments are 
compatible with a given contour segment without having to consider all of the other 
5 contour segments. According to the present invention, the values of one or more hash 
functions are computed for the contour segments; in an illustrative embodiment, the 
segments are then made accessible through a hash table. Hash tables are well known 
to those skilled in the art, and they allow efficient access to elements with a given hash 
value without having to search through the elements* It is also possible to use hash 
10 functions without actually using a hash table; such use still requires 0(N 2 ) comparisons 
in the overall method but allows each comparison to be done very rapidly. The purpose 
H of the hash functions is to rapidly eliminate most incompatible contour segments without 
p having to examine every other contour segment, and the hash functions should be 

w chosen appropriately. Note that the hash functions do not replace the need for or use of 

ui 

IS contour constraints, but they simply make the application of such constraints 
jH computationally feasible. The reduction of the computation from 0(N 2 ) to a lower order, 
? or even to 0(N 2 ) with a lower overall constant, is merely illustrative and not restrictive, 
J but is meant to suggest that greater efficiency can be obtained by the proper choice of 
W hash functions. 

f| By their very nature, hash functions perform quantization, and such quantization 

^ implies that it is always possible to have two very similar elements that have different 
hash values. As a simple example, consider hashing the x-coordinate of a set of points 
with the hash function H(x) - fioor(x/l 0), i.e. the hash value is the nearest lower multiple 
of io. Then, for example, we have the desirable property that the relatively nearby 

25 values x = 1 .5 and x = 5.4 both hash to 0, while the relatively distant value x = 1 1 5 
hashes to the different value of 11 . Thus, if we want to consider only nearby points as 
possibly "compatible," and we are trying to determine the set of points compatible with x 
= 5.4, then this hash function quickly eliminates most distant points. However, we also 
have the undesirable property that the two nearby values x *= 9.99 and x - 10.01 have 

30 different hash values: H(9.99) = 9 and H( 1 0.01 ) = 1 0. With this hash function, however, 
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we can easily determine which other hash values are "similar" to a given hash value 
(namely H-1 and H+1), and then we can consider elements with similar hash values as 
well as those with identical hash values in order to eliminate the quantization problem. 
Similarly, if one wanted to hash angles with H(A) = floor(A/20) where A is in degrees, 
5 one would note that H+1 and H-1 were similar hash values, with the further specification 
that such addition and subtraction be performed modulo 18 (= 360/20). For example, 
the angle A=359.99 would have H(A)=17, and hash values 16 and 0 would be 
considered similar. Clearly, one could also consider a larger range of hash values if 
one wanted to allow for increasing dissimilarity. In an illustrative embodiment, the hash 
10 f unctions are chosen such that elements which are 3omewhat similar have similar hash 

values, and it is easy to determine the set of hash values which are similar to a given 
p hash value. In general, for an arbitrary metric M where the similarity of two elements is 
f\ approximately a linear function of ith - m 2 (possibly using modulo arithmetic), then one 
«1 can use the hash function H(M) = f|oor(M/Q) where Q is a quantization value. As will be 
|i clear to those skilled in the art, many other hash functions are possible, and the use of 
W any of them is within the scope of the present invention. 

m it is often the case that even though contours appear discontinuous, the gaps 

yi between the contour segments of a given contour are relatively small, and in such 
1=1= cases two contour segments are unlikely to belong to the same contour If their 
|0 endpoints are distant, even if they seem otherwise compatible. In many applications, 
for example, if two short line segments are on opposite sides of an image but happen to 
be co-linear, they should not be considered part of the same contour (unless, of course, 
there is an intervening set of segments between the two). In such cases, it is desirable 
to use a hash function of the endpoint locations that represents a tiling of the space. 
25 For simplicity, a square or rectangular tiling can be used, although use of any tiling is 
within the scope of the present invention. 

In an illustrative embodiment, the hash function, which will be referred to as the 
"spatial hash function," is 

H s (x, y) = floor((x - Xo)/ s«) + N x * floor((y - y 0 )/ Sy) 
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where (x, y) are the coordinates of one endpoint of a contour segment, (x 0 ,y 0 ) are the 
coordinates of some arbitrary origin, s x and s y are the tile size in the x and y directions 
respectively, and N* is the number of hash values in the range needed to cover all x 
coordinates. For example, if one first computes the bounding box of all the endpoints, 

5 then one might choose xo to be the smallest (i.e. closest to negative infinity) x 

coordinate of all the endpoints and y 0 to be the smallest y coordinate, in which case N x 
= ceii((x max - x 0 )/s x ), where Xma* is the largest x coordinate of all the endpoints. Note 
that *f ioor(x)" and M ceil(x)" are the functions usually denoted In mathematical literature 
as LxJ and fxl, respectively. With this hash function, the hash values which are similar to 

10 a given hash value h (i.e. which represent elements in tiles adjacent in a 3x3 

neighborhood to the given element) are h - 1 , h - N* - 1 , h - N x , h - N x + 1 , h + 1 , h + N* 

u + 1 , h + N x , and h + Nx - 1 . Note that for tiles on edges or comers, some of these hash 

o 

P values can be immediately neglected, 

This hash function H s is applied to each of the two endpoints of a contour 
iS segment, so that a contour segment may have two hash values; a contour segment 

may have only one hash value if both of its endpoints lie in the same spatial bin. The 
« segments are then made accessible by a "spatial hash table," which is a hash table 

hz'z 

fU where a contour segment is accessible by each of its hash values. Note that additional 
j*| hash functions could also be applied to further reduce the number of possible 
5§ candidates for joining to a given contour segment, such as a hash of the angle of a 
f " contour segment (assuming the segments were approximately linear). However, in 
practice, the spatial hash function alone suffices to make the number of candidates 
computationally feasible. 

An illustrative embodiment of the invention is illustrated in more detail in the 
25 flowchart of Figure 3 and the graphical representation of Figure 4. First, in step 305, the 
input image is processed to produce a magnitude image and an angle image, using 
either a set of large oriented filters or a Sobel edge detector. Next, in step 308, 
truepeak and subpixel interpolation are applied to the magnitude and angle images to 
produce edgelets that have subpixel accuracy; this step may also use magnitude 
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thresholding. Steps 305 and 308, taken together, correspond to the general feature- 
detection step 215. 

Next, in step 31 1 , connectivity in the 3x3 pixel grid sense and angle constraints 
are used to connect edgelets into unbranched chains. In step 31 4, the chains are 
5 broken into linear sub-chains based on RMS error of a line fit to subsections of the 
chain. At this point, some sub-chains may be discarded if they are very short (e.g. 
contain less than 4 edgelets) or have a low average magnitude. Steps 31 1 and 314, 
taken together, correspond to the general chaining step 220, with the sub-chains of step 
314 corresponding to the chains of step 220. 

to Next, in step 317, the sub-chains are considered to be general contour 

segments, in order to make the general iterative or recursive structure of the method 

u clearer. 

5 The bounding box of all the endpoints of the contour segments is computed in 

jj step 320. In step 325, a spatial hash table is constructed using the bounding box and a 
%$ bin size, and all the contour segments are made accessible through the hash table 
vj based on the spatial hash values of each of their endpoints. Note that the computation 
m of the bounding box merely facilitates determining the range of possible hash values 
U and may simplify construction of the hash table for some implementations. Also note 

HI 

that the bounding box should be enclosing but need not be tight; thus, one could simply 

S© use the boundaries of the image as the bounding box to avoid computing a tight 

2 bounding box. 

For use in step 325, a bin size needs to be chosen for the spatial hash bin grid 
50. The size of the bins 52 can be chosen depending on the size of allowable gaps 56 
between sub-chains 58 that can be part of a single defect 60. If the bins 52 are too 

25 large, then the following steps of combining sub-chains 58 become too computationally 
intensive, if the bins 52 are too small, then only small gaps 56 will be allowed in 
reconstructing a defect 60. In the extreme cases, for example, a single bin 52 
comprising an entire image would not reduce the set for comparison and would result in 
a maximal 0(N Z ) number of comparisons. If the bins 52 were comprised of single pixel 

30 54 locations then only sub-chains 58 having ends within one pixel 54 of each other 
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would be candidates for joining. The illustrative implementation of the invention works 
well with bins 52 that are ten by ten pixel 54 squares. It should be noted that small sub- 
chains may be associated with only one bin 52, for example, if both ends of the sub- 
chain fall within the same ten by ten pixel 54 bin 52. Larger sub*chains 58 and small 
5 sub-chains 58 that cross-bin boundaries 62 are generally associated with two bins 52: 
one for each end. 

The illustrative embodiment chooses the bin size based on the maximum gap 
distance G that is allowed between endpoints of two contour segments that can be 
considered for Joining, and in fact chooses the bin size to be equal to the maximum gap 
10 size and considers all bins in a 3x3 neighborhood to be similar. With these choices, it is 

clear that all contour segments whose endpoints are within the maximum gap distance 
frt will be found. Some contour segments at greater distances (possibly up to 2G^2) may 
pi also be found, and these may then be discarded by a more precise calculation of 

fees' 

Uj endpoint distance. One could also use a smaller bin size and a larger similarity 
|| neighborhood in order to decrease the number of candidates found and possibly to 

*4l decrease or eliminate the need to perform a more precise calculation in order to discard 

id 

r some candidates, especially if the relevant constraint were only approximate anyway. 
As part of step 325, the bin numbers of the endpoints of each contour segment 
Ul are determined, and the contour segment is assigned to those bine (or to a single bin if 
Sf the two bin numbers are the same), The bin number is simply the value of the spatial 
M hash function Hs described above, using the bounding box of all the sub-chain 

endpoints to obtain x 0l yo, and Xmax and setting s x = Sy= G. This step corresponds to 
hashing the sub-chains 58 into their respective bin or bins 52 based on the location of 
their endpoints, and also to step 230 of Figure 2. 
25 in step 330, one contour segment is chosen as the seed segment. Usually the 

longest segment is chosen, but one might also consider the segment's magnitude or 
other factors. In general, it is preferable to begin with the segment that is considered to 
be the most likely to be a real contour segment (e.g. not just noise), so that the actual 
contour segments are given a chance to grow before notee segments. Such ordering is 
30 not necessary, but it can produce more robust results. 
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In step 335, the hash values of the endpoints of the seed segment are used to 
find candidate segments for joining using the spatial hash table; all segments whose 
hash values are in a 3x3 bin neighborhood of the seed hash values are initially 
considered as candidates for joining. Note that there may be either one or two seed 
5 hash values, depending on whether both endpoints of the seed segment fall into the 
same hash bin. Also note that, if there are two seed hash values and they are in 
adjacent bins, then the two 3x3 neighborhoods overlap, and the method may take 
advantage of this to avoid duplicating access to the hash table. If contour segments 
also have polarity information (e.g. light-on-dark or dark-on-light, as might be 
10 determined using some oriented filters), then one might also choose to reject any 

candidates whose polarity differed from that of the seed segment. Typically, the use of 
the hash table can reduce the number of candidates from a few hundreds or thousands 
to just one or a very few. 

Processing then proceeds, via off-page connectors 337 and 338, to step 340, 
•11 where the precise endpoint-to-endpoint distance between the seed segment's endpoints 
[J and each candidate segment's endpoints is computed, and any candidates whose 
j distance exceeds the allowed gap distance is removed from the set of candidates. In 
ry this way, the method removes slightly more distant candidates that may have been 
W included simply because of the quantizing nature of the spatial hash table. Note that, 
§ because a segment has two endpoints, there are four possible distances one may 
** compute between two segments; the minimum of these four is taken to be the distance 
between the segments. One may also use an allowed gap distance that varies, such as 
one that is a function of the two segments under consideration. For example, one might 
want to have a maximum gap distance of 50 pixels, but limit the actual allowed gap 
25 distance to be the length of the longer of the two segments, in order to decrease the 
chance of connecting two unrelated segments. 

Next, in step 345, the candidates are sorted by their minimum endpoint-to- 
endpoint distance from the seed segment, with the closest segments at the beginning. 
This sorting is done to facilitate greedily growing the seed segment and following the 
30 most likely contour, and generally one should consider the most likely matches first. 

20243/25 ^ 4 
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One might also consider all matches and then choose the best. For example, in some 
applications it might be better to consider candidate segments whose angles matched 
most closely first, instead of those whose endpoints were closest first. 

In step 350, contour constraints are applied to decide 355 whether to actually join 

5 a given candidate segment with the seed segment. For example, the angle between 
the two segments might be compared, and the candidate would be accepted only if the 
angles were fairly close, e.g. within 30 degrees, and the total RMS fit error to a single 
line of all the points of the two segments were below some threshold, e.g. 0,5 pixels. 
If the decision is made to join the candidate segment with the seed segment, 

10 then the joining is performed in step 360; otherwise, processing proceeds with step 365. 
The joining typically involves updating the seed segment (e.g. to have new endpoints, 

: ; RMS fit error values, curvature values, etc M depending on the nature of the segments), 
removing the candidate segment from any further consideration (e.g. for joining with 

o 

ty other potential seed segments or for being considered as a seed segment itself) and 
if from the spatial hash table, and updating the seed segment's hash values (e.g. because 
J its endpoints changed). Note that joining does not always imply endpoint-to-endpoint 

5 - 5 

w connection, and other techniques might be more appropriate depending on the nature of 
M the contour segments and the desired contours. Processing then proceeds with step 
[Jj 365, where a determination is made as to whether there are more candidates remaining 
it to be considered for joining. 

fl In step 365, if there are more candidates, then the next candidate is examined in 

step 370 and processing returns to step 350; if there are no more candidates, 
processing continues with step 375. In step 375, a determination is made whether any 
candidates were joined to the seed segment (during any iteration of step 360). if such 

25 joining ever occurred, then the seed segment may have altered in such a way that there 
may now be additional segments that would be candidates for joining With it, so the 
process returns, via off-page connectors 377 and 378, to step 335 to find new 
candidates. 

If no joining was done, then the process proceeds with step 380, where a 
30 determination is made whether there are more segments that should be used as a seed 
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segment. Typically, any remaining segment which has not yet been considered as a 
seed segment should be used; note that any segments that were joined to another 
segment were already removed from consideration in step 360. If there is another 
potential seed segment, then processing returns, via off-page connectors 382 and 383, 
5 to step 330; otherwise, processing continues with step 385. 

In step 385, a determination is made whether to continue processing with a new 
gap size and/or new constraints. For example, one may wish to progressively build up 
larger more reliable contour segments by first using a relatively small gap size to form 
larger contour segments that are more reliable than the initial chains, discarding 
10 segments that are too short or have low magnitude, and then using a larger gap size to 

find further connections among the new segments. One might also perform an initial 
\a iteration that enforces linear constraints to find long line segments and then perform a 
p second iteration that allows curvature or bending when connecting those line segments. 

W In an illustrative embodiment for fiber end-face inspection, three passes may be used. 

PI 

M The first pass uses a gap distance of ten pixels and uses linear constraints, after which 
S all line segments shorter than ten pixels are discarded. The second pass uses a gap 
= distance equal to the radius of the fiber cladding and uses linear constraints. Finally, a 

third pass uses a gap distance of ten pixels and allows significant bends (up to 90 
W degrees) between line segments; the final contours are simply represented as the eet of 
p| connected line segments. As a result of the techniques used for polishing fiber end 
H faces, most scratches tend to appear as long straight lines, although some are curves 
with a large radius of curvature and some have one or two sharp kinks. Thus the 
multiple-pass approach tries to build up the most likely contours first. 

By varying the rules to accept connections, alternative embodiments of the 
25 invention can accept various contours such as curves and splines that do not 

necessarily have a particular shape. This overcomes a substantial limitation of Hough 
transform methods, which do require advance knowledge of contour shapes in order to 
detect them. Additional more complex connectivity rules can also be used to avoid 
creating certain unusual intersection shapes. 
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If, in step 385, a decision to use a new gap size and/or constraints is made, then 
the process returns, via off-page connectors 387 and 388, to step 325, where a new 
spatial hash table is constructed, if necessary. If no new gap size or constraints are to 
be used, then the process finishes with step 390. 

5 Although illustrative embodiments of the invention are described herein in terms 

of finding scratch features in images of optical fiber end-face surfaces, persons of 
ordinary skill In the art should appreciate that the method according to the invention can 
be practiced to locate any number of features in image data. Furthermore, although the 
illustrative embodiments are described herein in terms of combining linear sub-chains to 

10 form contours, persons of ordinary skill in the art should recognize that sub-chains may 
be defined with any number of qualifying characteristics (other than mere linearity) as 
basic building blocks of alternative contours without departing from the scope of the 

fee; 

P present invention. 

y Although the various embodiments of the invention are described herein in terms 

IS of a hash table having uniform square bins, persons of ordinary skill in the art should 
Jl appreciate that the Invention can be practiced using any of various configurations of 
W bins, including rectangular bins and bins of varying dimension and shape across an 
| s , image. For example, hexagonal bins might be used. Similarly, hash similarity 
j 1 ! neighborhoods other than a square 3x3 bin neighborhood may be used in alternative 

|® embodiments of the present invention; for larger neighborhoods, one might even 

n 

™[ consider other neighborhood shapes, such as neighborhoods that approximate circles 
rather than squares. 

Although the local neighborhood for considering chaining true peaks in illustrative 
embodiments of the invention is described in terms of the three-by-three pixel area 

25 centered on a pixel of interest, persons of skill in the art should appreciate that 

neighborhoods of different dimensions can also be considered without departing from 
the spirit and scope of the invention. Also, although the local chaining technique is 
described as requiring edgelete to b© adjacent in a 3x3 pixel area, persons of skill in the 
art should appreciate that such a strict adjacency requirement may be relaxed, 

30 especially when using a larger neighborhood size. 
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A software implementation of the above-described embodiment may comprise a 
series of computer instructions either fixed on a tangible medium, such as a computer 
readable medium, e.g. a diskette, a CD-ROM, a ROM memory, or a fixed disk, or 
transmissible to a computer system, via a modem or other interface device over a data 
5 communications medium. The medium either can be a tangible medium, including, but 
not limited to, optical or analog communications lines, or may be implemented with 
wireless techniques, Including but not limited to microwave, Infrared or other 
transmission techniques. It may also be the Internet. The series of computer 
instructions embodies all or part of the functionality previously described herein with 

10 respect to the invention. Those skilled in the art will appreciate that such computer 

instructions can be written in a number of programming languages for use with many 
m computer architectures or operating systems. Further, such instructions may be stored 
ji; using any memory technology, present or future, including, but not limited to, 
W semiconductor, magnetic, optica! or other memory devices, or transmitted using any 
j| communications technology, present or future, including but not limited to optical, 
w infrared, microwave, or other transmission technologies. It is contemplated that such a 
T" computer program product may he distributed as removable media with accompanying 
• j printed or electronic documentation, e.g., shrink wrapped software, pre-loaded with a 
[J computer system, e.g., on system ROM or fixed disk, or distributed from a server or 
M electronic bulletin board over a network, e.g., the Internet or World Wide Web. 
U Although an exemplary embodiment of the invention has been disclosed, it will 

be apparent to those skilled in the art that various changes and modifications can be 
made which will achieve some of the advantages of the invention without departing from 
the spirit and scope of the invention. For example, although an embodiment for fiber 
25 end-face inspection was discussed, those skilled in the art will appreciate that the 

Invention can be applied to a great many other applications, such as inspection of laser 
diodes or lenses, or inspection of automobile paint finish, or automatic generation of a 
geometric description of an object in an image for use in alignment. Use of any such 
embodiment for the purposes described herein is within the spirit and scope of the 
30 invention. Other aspects, such as the specific instructions utilized to achieve a 
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particular function, as well as other modifications to the inventive concept are intended 
to be covered by the appended claims. 
What is claimed is: 
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